Using ELAN for annotating sign language corpora in a team setting

نویسندگان

  • Onno Crasborn
  • Han Sloetjes
چکیده

ELAN is a multimedia annotation tool that is employed in many sign language corpus projects. It is a standalone desktop application that, like many other desktop applications, principally is a single user, document oriented application. In many scenarios this is still perfectly satisfactory but in large-scale corpus projects, involving many collaborators who are working on the same documents, the problem arises of how to resolve edit conflicts and how to prevent undesirable modifications to parts of the document. The Corpus NGT project is such a project and this paper describes the challenges that arose in the process of its creation as well as in the exploitation of this large collection of annotation documents. It outlines recent and possible future development of ELAN and alternate solutions that have been explored and applied.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Conventions for sign and speech transcription of child bimodal bilingual corpora in ELAN.

This article extends current methodologies for the linguistic analysis of sign language acquisition to cases of bimodal bilingual acquisition. Using ELAN, we are transcribing longitudinal spontaneous production data from hearing children of Deaf parents who are learning either American Sign Language (ASL) and American English (AE), or Brazilian Sign Language (Libras, also referred to as Língua ...

متن کامل

Towards Automatic Sign Language Annotation for the ELAN Tool

A new interface to the ELAN annotation software that can handle automatically generated annotations by a sign language recognition and translation framework is described. For evaluation and benchmarking of automatic sign language recognition, large corpora with rich annotation are needed. Such databases have generally only small vocabularies and are created for linguistic purposes, because the ...

متن کامل

Empowering ELAN with N-gram Analytics for Corpora

American Sign Language, the preferred language of the Deaf community in the USA is its own language; complete with a rich collection of grammatical features. The DePaul University team has been working on an automatic English/ASL translator implemented as a 3D avatar in order to facilitate better communication between hearing and deaf people. Animations suffered from various timing inconsistenc...

متن کامل

Using sign language corpora as bilingual corpora for data mining: Contrastive linguistics and computer-assisted..

More and more sign languages nowadays are now documented by large scale digital corpora. But exploiting sign language (SL) corpus data remains subject to the time consuming and expensive manual task of annotating. In this paper, we present an ongoing research that aims at testing a new approach to better mine SL data. It relies on the methodology of corpus-based contrastive linguistics, exploit...

متن کامل

BosphorusSign: A Turkish Sign Language Recognition Corpus in Health and Finance Domains

There are as many sign languages as there are deaf communities in the world. Linguists have been collecting corpora of different sign languages and annotating them extensively in order to study and understand their properties. On the other hand, the field of computer vision has approached the sign language recognition problem as a grand challenge and research efforts have intensified in the las...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017